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There have been many analyses made of models for equipment inspection, i.e., where a system 
may suffer a breakdown, but such an event is only discovered by an inspection. Most analyses as- 
sume that the time to failure follows a negative exponential law which implies that only periodic 
inspection programs need be considered. Another model which has been analyzed by Barlow, Hunter, 
and Proschan finds the optimal program of inspections when the equipment reliability function is of 
general form, but a particular loss function is given. In this paper we find the optimal periodic in- 
spection program for systems which do not have negative exponential reliability functions. These 
programs have the virtue of simplicity even though they may not be optimal in an absolute sense. 
Besides the periodic inspection programs, we derive results for random inspection programs. 



1. Introduction 

There have been many papers published on the sub- 
ject of determining optimum checkout intervals for 
systems with random failure characteristics, [1— 5]. 1 
The objective of any checkout program is to minimize 
the down time due to system failure. When the re- 
liability function which describes the system's failure 
characteristics is negative exponential the optimal 
checkout policy must be a periodic one. If the system 
reliability function differs from the exponential, one 
does not expect such a simple result to obtain. Indeed 
Barlow, Proschan, and Hunter [3, 4] have devised an 
algorithm for calculating the optimum checkout sched- 
ule for a system with a general reliability function. 
They minimized the total loss incurred by inspection 
costs and the cost of uninspected down time to derive 
an optimum policy. This policy is, however, a difficult 
one to actually compute since all of the inspection 
times are a function of the first inspection time. 
This parameter must be varied in order to find the 
optimal policy. It has been found that the optimal 
inspection periods are extremely sensitive to the 
choice of the first period, hence a good deal of accuracy 
is needed in the calculations. Other studies of the 
problem have used the assumption of an exponential 
reliability function and found the optimal periodic 
solution [1-5]. 

It is the purpose of this paper to find the optimal 
periodic solution for the checkout interval of a sys- 
tem which does not necessarily have an exponential 
reliability function. Clearly, such a policy will not be 
optimal in the sense of Barlow, Proschan, and Hun- 
ter's results. However the periodic inspection pro- 
gram does have the advantage of simplicity and it is 
the sort that would most likely be followed in practice. 
The exact class of policies which will concern us will 
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be denoted by (A, T) policies. The parameter A is the 
inspection interval and T is a time at which the system 
is to be replaced by a new one if it has passed all of the 
previous inspections. Three possibilities may be con- 
sidered: A can be held fixed and T varied; T can be 
held fixed and A varied; or both parameters can be 
varied. In the case of an exponential reliability func- 
tion or no wearout, (A, oo) policies would be the only 
logical ones to consider if inspection is considered to 
be perfect. We may wish to consider this possibility 
for the general reliability function as well. In addition 
to an analysis of periodic inspection policies we shall 
also consider random inspection policies in which the 
periods between successive inspections are identically 
distributed random variables. 



2. The (A, oo) Policy 

Our criterion for determining the best (A, T) policy 
will be the parameter set which maximizes the ex- 
pected operational readiness of the system. Thus, 
implicitly the cost function will be measured in lost 
time; time that is spent in inspection and replace- 
ment. The present mathematical model is therefore 
applicable to computer systems and to other systems 
which are operated continually. 

We assume that system failure is complete, and that 
the system is replaced rather than repaired when 
breakdown occurs. The system failure characteristics 
will be described by a reliability function R(t) (the 
probability that a system installed at £ = will survive 
for a time t or greater). It will be assumed that the 
first moment 



Jo 



R(t)dt 



(1) 



is finite. If the inspection and replacement times are 
assumed to be random variables then one can dis- 
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tinguish between two types of models. In the first, 
inspections are scheduled for times A, 2A, 3A, ... re- 
gardless of the amount of time spent in the last inspec- 
tion. In the second model the beginning of the next 
inspection is scheduled A units of time after the end of 
the present one, i.e., inspection is scheduled after a 
usage period of A. If the inspection and replacement 
times are fixed rather than random the two models are 
formally identical after a redefinition of A. In this 
paper we will treat the case of a usage period of A. 
Finally, we will assume that the system is turned off 
during inspection, i.e., that it does not age during an 
inspection. 

The notation in addition to that already defined is: 

7; = time for a single inspection. 
r r — time for a single replacement. 
6 = probability of not detecting on inspection a system 
that has failed. 

The first case to be studied will be the (A, oo) policy, 
that is, inspection is continued until a system failure 
occurs. Let us consider the history of a single system 
and enumerate the times which will be necessary for 
our analysis. The nonoperational times will be of 
three types: An installment time; ti+1 inspection 
periods, where n is a random variable; and a period 
when the system is inoperative but that condition is 
not yet determined by inspection. There will also be 
n+ 1 periods of operation, at least n of them being of 
duration A, the remaining one being of duration less 
than or equal to A. For a single system the time his- 
tory is summarized in figure 1. Let T be the lifetime 
of a single system measured from its installation to 
the installation of the next system. Then T is the sum 
of two parts: 



T=T +T N 



(2) 



where To is the operational time and T N is nonopera- 
tional time. Barlow and Hunter have shown in [6] 
that the operational readiness, or proportion of time 
that the system is operational, is given in the limit 

of t — > oo by 



P = E(To) 

E(T )+E(T N ) 



(3) 



where E(T) is the expected value of T. This result is 
also a simple consequence of renewal theoretic con- 
siderations, [7]. Hence in order to evaluate P we 
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Figure 1. Representation of the history of a piece of equipment. 



must only calculate E(T ) and E(T N ) for a single system. 
The calculation of E(T ) is a simple one, since it is 
just the operational time till failure. This is just 



E(T ) = fL 



(4) 



where \x has been assumed finite. The calculation of 
E(Tn) is slightly more complicated. 7,v can be written 



T N = T r + T 1 + T 2 +. . . + r„ +1 + £ 



(5) 



where T r is the replacement, or installation time, Ti 9 
T2, . . . , T n +i are the inspection times till just after the 
failure (n is a random variable) and £ is the time lost 
due to faulty inspection. If the expectation of this 
equation is taken, we find 



E(T N ) = T r + TiE(n + 1) + E(Q 



(6) 



where the r's have already been defined. The random 
variable n has a distribution 

Prjexactly n operational periods} = R(n&) 

-*((»+ 1) A). (7) 
Hence E(n-\- 1) is 

E(n + 1) = J (n + 1) [R(nA) -R((n+ 1)A)] (8) 

n=0 

= £ft(/iA) = G(A) 

n=0 

where the finiteness of the first moment insures the 
convergence of this series. Finally, the residual 
nonoperational time is composed of the random time 
between the failure and the first inspection thereafter 
and a random period due to the effect of imperfect 
inspection. The probability density of a system failure 
is —R'(t), which implies that E(Q is given by 



00 f(n+ 1)A 

E(0=-Y ((rc+l)A -*)/?' 



(t)dt 



+ (1-0) £ tfl0 m (Ti + A) 
m=0 

n=0 



= AG(A)- M + 







1-0 



(Ti + A). 



Combining eqs (3), (6), (8), and (9), we find for the 
operational readiness 



P(A)=- 



M 



Tr + fa + A) G(A) + 



■—) 

1-0/ 



1 



(10) 



224 



where we have introduced the dimensionless param- 
eters 



kr = Tr/fJL, \i = Tilfl. 



(11) 



Occasionally useful bounds on P(A) can be obtained by 
use of the inequalities 



A + /u> AG(A)>/x 



(12) 



This is easily derived by reference to figure 2. There 
it is seen that AG(A) is the area under the upper set of 
rectangles, which is greater than the area under the 
curve for R{t). Furthermore the area under the curve 
for R(t) is greater than that under the lower set of 

00 

rectangles. This latter area is, however A ^ R(nA) 

= A(G(A) — 1) from which the upper bound follows. 
The inequalities of eq (12) thus yield bounds on 
P(A) as follows 



] 



Xr+ Ui + 



\x 



^ I 1 
A 1- 



<P(A) 



< 



\ r + U* + 



"JU + l-9/ 



(13) 



If we write this inequality as L(A) < P(A) < i/(A) then 
it is interesting to note that both L(A) and £/(A) have 
maxima. The maximum of L(A) occurs at 



A w = /xV\,(l-0) 

and the maximum of /7(A) occurs at 

A™ 



A' = 



V0 



(14) 



(15) 



Thus the maximum value of P(A) obeys the inequalities 
1 



\ r + 1 + 



i-e) 



<p« 



< 



A i+ M 



6 L X 
0/ 



(16) 



The optimal value of A, call it A , is the solution to 




G(A )+- 1 _ 

and the value of P(A ) is 
P(A ) = 



(x i + Ao)G'(A„) = 0. 



Tr-iTi + AofG'iAo) 



(17) 



(18) 



When Tiljx is small in comparison to 1 we can derive 
an approximate expression for A . If r, were identi- 




FlGURE 2. Plot ofR(t) and upper and tower bounding functions. 

cally equal to zero, that is, if inspection takes no time 
to carry out, then Ao = 0, or continuous monitoring is 
the optimal solution. If n/6 is small but not identically 
equal to zero, it may be supposed that Ao//ul will also be 
a small quantity. Let us then expand G(A ) and 
G'(A ) for small A by means of the Euler-Maclauren 
expansion, [8] 



G( A) = J? K(n A) = ! + ! - ^ A + 0( A 2 



G'(A) = 



JL 

A 2 



fl'(O) 
12 



(19) 



+ 0(A). 



In this approximation the equation for the optimal A 
becomes 

We have neglected terms /?'(0)A and R'(0)ri which are 
presumed small in comparison to terms like 1/2 and 
Ti/ul/A 2 in the present calculation. Equation (20) has 
the solution 



jit V 1 + fJL XflJ 

which generalizes a result found earlier for the ex- 
ponential reliability function [5]. To this approxi- 
mation the value of Pmax is 



'max 



2(1 -fl) 



2X r (l -d)+ (V\,-(l + 0)+ V2(l-0)) 2 



(22) 



In figure 3 we have plotted P(A/T) for the reliability 
function R(t) = exp (— fi/T 2 ) for A/7 1 in the range to 
0.20 and = and = 0.25. It can be observed that 
the variation of the maximum with A/7 1 is negligible in 
the range that we have considered. 

3. Random Inspection Periods 

Thus far we have considered an idealized case in 
which inspections are performed as scheduled. It is 
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FIGURE 3. Plot of P( A) for R(t) = exp (-t 2 /T 2 ) for 6 = 0.25 and \; = 
0.01, \ r = 0.05. 

of some interest to consider the possibility of random 
examinations, since this, rather than perfect control 
over inspection is likely to be a more accurate descrip- 
tion of actual monitoring situations. We will assume 
that the inspection following a given one takes place 
at some time t after the conclusion of the first inspec- 
tion. The time t is a random variable which will be 
described by a probability density function ^(t) with 
a finite first moment v. 

Again, in order to calculate operational readiness we 
may use eq (3) with E(T ) = fx. The expression for the 
nonoperating time given in eq (6) is still valid but it 
requires somewhat more effort to evaluate E(T N ). As 
before we need to evaluate E(n + 1) and £"(£). We con- 
sider E{n + 1) first. For the evaluation of this quantity 
we define a probability density w n (t) by 

w n (t)dt = Pr{sum of n periods of operating time is in 

U, t + dt)}. (23) 

The probability that there are exactly n complete 
operating periods without a system failure is 

ft n = * w n (t)R(t)dt- " W n+ i(t)R(t)dt (24) 

Jo Jo 

since the first integral is the probability that the 
system is reliable through at least n inspections and the 
second integral has a similar interpretation. The 
expectation of n + 1 is therefore 



■{/; 



E(n + 1) = £ (n + 1) j RU) [wn(t) - w n+ i(t)]dt 



Jo 



(25) 



where 

W(t) = ^ W n (t) 
n=\ 

and we have used as a definition 



(26) 



wo(t) = 8(t). (27) 

Since the w n (t) satisfy the recurrence relation 

W n +l(t)= W n (T) y ¥(t-7)dT 



(28) 



Wi{t) = V{t\ 



it is easily verified, by summing both sides of this 
equation, that w(t) is a solution of the renewal equation 



w(t) = y ¥(t) + 



w(j)^{t- 
Jo 



r)dr. 



(29) 



Notice that w(t)dt is the probability that some inspec- 
tion occurs after an amount of operating time between 
t and t + dt. 

Finally, we must evaluate E(Q. Let us assume that 
there is a system failure after r hours of operating 
time. The first component of £ is the time to the im- 
mediately following inspection period. A second 
component will have to be added in case 6 # 0. Let 
the time from the system failure to the immediately 
following inspection be t (in the terminology of re- 
newal theory this is the forward delay time). In order 
to calculate the distribution of the forward delay we 
must take two possibilities into account; either the 
next inspection is the first one, or else the last inspec- 
tion ended in the time interval (x, x + dx) with probabil- 
ity w{x)dx. These last two events combine to give a 
result for the forward delay density at time r, p(r, t), 

p{r, t) = ^(T + t)+\ w(x)^(t + 1 - x)dx. (30) 

jo 

The first component of E(Q is 

Ei(0 = ~ f X dr f" dttp(r,t)R'{T) (31) 

Jo Jo 

where we have averaged over all possible failure times. 
The second component of E{Q is the expected down 
time due to undiscovered failure. The probability 
that it will take exactly n inspections to discover the 
failure is (1 — 0)6 n ~ l and the expected amount of down 
time consumed before the discovery is made is 



E 2 (Q = (l-0)J j d n -i(n-l)(v + T i ) 



= 6(V + Tj) 

1-6 



(32) 



R(t)w(t)dt 



since v is the expected time between two consecutive 
inspections. Combining the expressions for EiiQ, 
E 2 (Q and E(n-\- 1) we have the final result 
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7> + /X + Tj 



■/; 



1 + R(t)w(t)dt 



+ 



g(y + Tj) 

1-0 



Jo ^0 



dttp(T,i)R'{r) 



(33) 



In the particular case of a completely random exami- and the maximum value of P is 
nation policy, 



V(t) = are- (rt 



(34) 



all of the quantities required by eq (33) are readily 
evaluated. It is well known and easily verified that for 
the negative exponential distribution one has 



w(t) = (T, v= l/cr 
p(r, t) = (re- (rt . 



(35) 



Thus we find that 



o-te- (T( dt=-R(r) 
cr 



-f" R'(r)dr\ 

Jo J 

R(t)w(t)dt = cr R(t)dt = a/Ji. 

Jo Jo 

The value of P can be given explicitly as 

o~ 



(36) 



P = 



+ X/o- 7^ + 0-^) + 



1-0 



(37) 



I -6)ii 



where the dimensionless parameters A, and k r have 
been defined before. The solution for the maximizing 
value of cr is 



o- = 



1 



Wwi-fl) 



(38) 
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Figure 4. 



Plot o/P (cr) for random maintenance for ■ 

ki = 0.0f\ r = 0.05. 



-- 0.0, 0.25 and 



Pmax-yi + K+Y^+ 2 \izr0) 



lit d( 



(39) 



not de 



It is interesting to notice that this 
pend on R{t) except through /jl. 

In figure 4 we have plotted P as a function of ct/ul for 
\ r = 0.05, \,- = 0.01, and = and 0.25. It will be ob- 
served that the curve is rather flat around the maximum, 
and the maximum itself does not shift appreciably as 
6 is changed. 

4. The (A, NA) Case 

When the reliability function drops off much more 
quickly than a negative exponential, it is plausible 
that one might want to replace the equipment after a 
specified time rather than continuing to inspect it. 
Even if the reliability function is a negative exponential, 
finite scheduled replacement times would be indicated 
if the quality of inspection is poor, i.e., if is close to 1. 
Quite generally, one might consider a (A, /VA) policy in 
which replacement takes place after /VA units of op- 
erating time. In this way the number /V becomes an 
adjustable parameter in the problem, and the policy 
can be maximized over both A and N. This procedure 
leads to quite complicated expressions. Let us there- 
fore restrict our efforts to a comparison of (A, A) and 
(A, op) policies for particular parameter values. 

For the (A, A) policy we note that 

E(To)=( R(x)dx = tfb) (40) 

Jo 



and 



E{T N ) = Tr + b-E(T ). 



(41) 



These results are to be substituted into eq (3) to obtain 
the expression for P(A) 



P(A) = 



J R(x)dx 

Jo 

A + T r 



(42) 



In figure 5 we have plotted P(A) as a function of 
A/jT for R(t) = exp (—t/T), for the parameter values 
\r = r r /T= 0.05 and \i = Ti/T=0.0l. When = one 
can do better with a (A, °o) policy than with a (A, A) 
policy although when = 0.9 this is no longer the case. 
For other parameter values there may exist an /V such 
that the (A, /VA) policy is better than either the (A, A) 
policy or the (A, oo) policy. The value of P(A) for the 
(A, NA) policy can be calculated from E{T N ) given by 
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Figure 5. 
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Plot of P(A) against A/T for R(x) 
kj = 0.01, k,=0.05. 



- exp (— x/T) for 



The expression of eq (43) is derived by noting that if 
the system fails at time x in the interval [/A, (/+ 1)A] 
then the time loss due to inspections and to down time 
is j A + k( A + n) — x if A: — 1 inspections have been incor- 
rect, accounting for the first set of terms. If all in- 
spections are faulty then the term 1 — 6 is to be omitted 
and the second set of terms results. Finally, if the 
system fails in [(N— 1)A, NA~] there are no inspections 
so that the last term has no 6 dependence. The ex- 
pression for P(A) which results from the use of eqs 
(43) and (44) is so cumbersome that no information can 
be extracted from it without considerable computation. 



I thank the readers Joan Rosenblatt and Joel Levy 
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E{T N ) = T r -{\- 



A-2 N-j-1 
j=0 k=l 



6 k -l 



/ 



O'+DA 



\jb + k(k + Ti)-x]R'(x)dx 



jA 
V-2 



J j± 



O'+DA 



[NA + (N-j-l)n-x]R'(x)dx 



i 



(NA-x)R'(x)dx (43) 



(A 7 - DA 



and 



-/; 



E(T )= R(x)dx. 



(44) 
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